Expedia | Data Engineer - II Interview Experience | 2.5 YoE



Round 1 โ€“ Technical Screening

Format: 90-minute online assessment

DSA Questions

๐Ÿ”น Array-based Question

๐Ÿ”น Required efficient manipulation and traversal techniques.

๐Ÿ”น Focused on optimal time and space complexity.

Matrix-based Question (BFS)

๐Ÿ”น Problem involving Breadth-First Search (BFS) on a matrix.

๐Ÿ”น Tested understanding of graph traversal techniques and edge case handling.

SQL Question

A complex SQL problem involving:

๐Ÿ”น Multiple CTEs (Common Table Expressions)

๐Ÿ”น Sequential transformation of data

๐Ÿ”น Final output through joins and aggregations

Round 2 โ€“ SQL Deep Dive

Introduction

๐Ÿ”น General background discussion and overview of previous roles.

SQL Scenarios

Rolling Mean per Product

๐Ÿ”น Given a product table with dates, implement a rolling mean over 3 consecutive dates.

๐Ÿ”น Used window functions with UNBOUNDED PRECEDING.

CTE + Join + RANK()

๐Ÿ”น Two tables provided.

๐Ÿ”น Required: A CTE, Join operation, RANK() window function

๐Ÿ”น Follow-up: Modify solution to use a LEFT JOIN for inclusive results.

SQL Concepts Discussion

Indexing Types:

๐Ÿ”น Clustered vs Non-clustered

๐Ÿ”น Bitmap, B-Tree, etc.

Partitioning:

๐Ÿ”น Horizontal vs Vertical

๐Ÿ”น Partition pruning techniques

File Formats:

๐Ÿ”น CSV, Parquet, ORC

๐Ÿ”น When to use which based on performance and use case

Round 3 โ€“ Data Governance and Practices

Introduction

๐Ÿ”น Discussion around prior work experience and roles.

Behavioral & Conceptual Topics

Expedia Value โ€“ โ€œForce Simplicityโ€

Asked for a significant real-life instance where I:

๐Ÿ”น Reduced complexity

๐Ÿ”น Created clear, impactful outcomes

User-Centricity

๐Ÿ”น Building with user empathy

๐Ÿ”น Gathering feedback and improving data products

Data Governance

๐Ÿ”น Access control

๐Ÿ”น Data lineage

๐Ÿ”น Auditability

๐Ÿ”น Metadata management

Data Cleaning & BI Practices

๐Ÿ”น End-to-end cleaning pipelines before BI integration

๐Ÿ”น Dealing with nulls, duplicates, inconsistent formats

Challenges & Learnings

Past challenges related to:

๐Ÿ”น Pipeline maintenance

๐Ÿ”น Stakeholder alignment

๐Ÿ”น Data quality